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(54) Interlaced video signal encoding and decoding method, using conversion of periodically 
selected fields to progressive scan frames 



(57) An encoding apparatus includes a selector (8) 
fa periodically selecting fields of an interlaced video 
signal to be converted to respective progressive scan- 
ning frames, by a scanning converter (1) which doubles 
the number of scanning lines per field. The apparatus 
encodes these frames by intra-frame encoding or unidi- 
rectional predictive encoding using preceding ones of 
the frames, and encodes the remaining fields of the 
video signal by bidirectional prediction using preceding 
and succeeding ones of the progressive scanning 
frames for reference. The resultant code can be 



decoded by an inverse process to recover the interlaced 
video signal, or each decoded field can be converted to 
a progressive scanning frame to thereby enable output 
of a progressive scanning video signal. Enhanced accu- 
racy of motion prediction for inter-frame encoding can 
thereby be achieved, and generation of encoded alias- 
ing components suppressed, since all reference pic- 
tures are progressive scanning frames rather than pairs 
of fields constituting interlaced scanning frames. 
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Description 

Field of the Invention 

The present invention relates to a method and s 
apparatus for high-efficiency encoding to efficiently con- 
vert an interlaced type of video signal into a stream of 
compressed code, for the purpose of transmission or 
storage. In particular, the invention relates to encoding 
processing which uses bidirectional motion prediction 
encoding, applied to an interlaced type of video signal. 

!n the following, the term "picture" will be used as a 
general term for referring to the contents of afield of an 
interlaced video signal, or a frame of aprogressive scan- 
ning (i.e., non-interlaced] type of video signal. 

Description of the Prior Art 

A method of high-efficiency video encoding for an 
interlaced type of video signal is known whereby one in 
every m successive frames (where m is an integer of 2 
or more) is encoded either independently by internal 
encoding or by unidirectional predictive encoding, while 
the remaining frames (referred to as the B frames)are 
encoded by bidirectional predictive encoding using pre- 
ceding and succeeding ones of the aforementioned 
specific frames (i.e., I or P frames). 

Such predictive encoding of a video signal is now 
well known in the art, being described for example in 
Japanese patent laid-open number HEI 2-192378, of 
the assignee of the present invention, etc. The tech- 
nique is also used with the MPEG-1 system (ISO/IEC- 
1 1 1 72), and the MPEG-2 system (ISO/IEC-13818). 

It can be understood from the above that with such 
a method three different picture types are established, 
in accordance with the form of encoding that is 
applied, i.e. the I frames, the P frames and the B frames, 
and that only the I frames and P frames are used as ref- 
erence frames for encoding. 

Assuming that the frame period is 1/30 second, the 
first and second fields of each frame of the interlaced 
video signal are time-displaced by 1/60 second, and are 
also mutually displaced by one scanning line position, in 
the vertical direction of the picture. As a result, appropri- 
ate prediction for the purpose of picture encoding can- 
not be achieved by using a simple inter-frame prediction 
method. In such a case, a method is used such as with 
the MPEG-2 standard etc.. whereby processing is per- 
formed in units of fields, with a plurality of fields being 
used to constitute a reference picture, or whereby 
processing is basically performed in units of frames, but 
with prediction being switched to perform local predic- 
tion in units of fields when necessary. 

In particular, in the case of the MPEG-2 stand- 
ard.each of the aforementioned picture types (i.e., I, P, 
B)must be established in units of frames. That is to 
say.even if prediction processing is performed in units of 
fields, the l-pictures and B-pictures must each be set as 
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respective consecutive pairs of fields. 

Whichever method is used, when motion is 
detected between the contents of successive fields, pre- 
diction is performed in units of fields. In that case, due to 
the scanning line configuration of the interlaced signal, 
large amounts of aliasing components will be generated 
and encoded, due to the one-line vertical displacement 
between successive interlaced fields. As a result, even 
if the picture motion only consists of a parallel shift of 
the picture, comparatively large amounts of prediction 
error values will be generated and encoded, so that the 
predictive encoding process will generate excessively 
large amounts of code, i.e. the objective of achieving a 
high efficiency of data compression through encoding 
will not be met. 

Fig. 5 shows an example of the configuration of a 
prior art type of video encoding apparatus which uses 
bidirectional prediction for encoding the B fields. It will 
be assumed that prediction is performed in units of 
fields, but that the I, P and B picture types are estab- 
lished in units of interlaced frames as described above. 

The interlaced video signal which is input to the 
video input terminal 7 is supplied to the input signal 
selection switch 56 which is controlled to operate in syn- 
chronism with successive fields of the input video signal 
such that the I and P frames are supplied to a subtracter 
51 while the B frames are supplied to a frame delay ele- 
ment 61. It should be noted that the term "video signal" 
as used herein signifies a digital video signal. 

One out of every m successive frames of the input 
video signal is selected as an I or a P frame, (where m 
will in general have a value of 2 or 3). The proportion of 
I frames to P frames is a matter of design choice. The 
subtracter 51 subtracts an inter-picture prediction signal 
(i.e., consisting of successive predicted values for 
respective pixels of a frame) that is produced by an 
inter-picture prediction section 57 from the I or P frame 
signal which is supplied thereto, and supplies the result- 
ant difference values, i.e. prediction error values, to a 
DCT section 52. The DCT section 52 performs DCT 
(Discrete Cosine Transform) conversion processing on 
successive sets of prediction error values which corre- 
spond to respective blocks of 8 x 8 (or 1 6 x 1 6) pixels of 
a picture, and the transform coefficients thereby 
obtained are supplied to a quantizer 53. The quantizer 
53 performs quantization of the coefficients, using a 
predetermined quantization step size, and the resultant 
fixed-length encoded coefficients are supplied to a vari- 
able-length encoder 54 and to a dequantizer 55. 

The variable-length encoder 54 performs array con- 
version of the 2-dimensional 8x8 sets of coefficients 
into a 1 -dimensional sequence, using zig-zag sequence 
processing, and encodes the result by Huffman encod- 
ing, i.e. using the numbers of runs of coefficient values 
of zero or of coefficient values other than zero. The 
resultant code sequences into which the I and P frames 
have been respectively converted are multiplexed with 
the code sequences which are obtained for the B 
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frames, by the multiplexer 13, and the resultant code 
stream is supplied to the code output terminal 1 4. 

The dequantizer 55 and the inverse DCT section 60 
perform the inverse processing to that of the quantizer 
and the DCT section 52, to thereby reproduce the inter- s 
picture prediction error values, and the values thus 
obtained are added to the prediction signal by the adder 
59, to obtain values expressing successive recon- 
structed pictures, which are supplied to the picture 
memory 58. The reconstructed pictures which are thus 
stored in the picture memory 58 are thereafter read out 
and supplied to the inter-picture prediction section 57 at 
appropriate timings. 

The inter-picture prediction section 57 generates 
different prediction signals in accordance with respec- 
tive types of picture (i.e., I, P or B), supplies the predic- 
tion signals derived for the I and P frames to the 
subtracter 51, and supplies the prediction signals 
derived for the B frames to one input of the subtracter 
17. 

Since no prediction is performed for an I frame, the 
prediction signal values for an I frame are always zero. 
In the case of a P frame, the prediction signal is 
obtained based on a preceding I or P frame. In the case 
of a B frame, the prediction signal is obtained based on 
preceding and succeeding I or P frames. 

With this method, since prediction is performed in 
units of fields, both even fields and odd fields of a recon- 
structed frame may be used as reference pictures. Of 
these, the field which results in the smallest amount of 
prediction error is used as a reference for deriving the 
prediction signal. 

When a B frame signal is selected by the switch 56, 
the frame delay section 61 applies a delay of (m - 1) 
frames, and the delayed B frame signal is then supplied 
to the subtract or 17. Since the picture type is estab- 
lished in units of frames, the delay must be established 
in units of frame periods. The resultant delayed picture 
signal (i.e. successive pixel values) is input to the sub- 
tractor 1 7 in synchronism with predicted values supplied 
from the inter-picture prediction section 57, to obtain 
respective prediction error values for the B frame, which 
are encoded by the DCT section 18, quantizer 19 and 
variable-length encoder 20, in the same way as for the 
DCT section 52, quantizer 53 and variable-length 
encoder 54. 

Since the B frames are not used as reference 
frames for inter-picture prediction, the encoding system 
for the B frames does not contain any local decoding 
section. 

In Fig. 5, the processing system for the I and P 
frames, and the processing system for the B frames, are 
respectively separate. However, since the processing 
which is performed by these is basically similar (folllow- 
ing the subtraction stage), it would be equally possible 
to execute all of the processing by a single system, 
through appropriate time-sharing operation. 

The code sequences obtained for the B frames are 
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multiplexed by the multiplexer 13 with the code 
sequences derived for the I and P frames in a different 
order from that of the order of the frames of the original 
input video signal. That is to say, the order must be 
changed such as to ensure that the code sequence for 
each B frame will not be transmitted from the code out- 
put terminal 14 until after the code sequences for the I 
or P frames which were used in predictive encoding of 
that B frame have been transmitted. 

A video decoding apparatus corresponding to the 
video encoding apparatus of Fig. 5 will be described in 
the following, referring to the system block diagram of 
Fig. 6. In Fig. 6, an input code stream (i.e., consisting of 
successive code sequences for respective I, P, B pic- 
tures which have been generated by the video encoding 
apparatus of Fig. 5) is supplied to a code input terminal 
33, to be separated by a demultiplexer 34 into the code 
sequences for the I and P frames and the code 
sequences for the B frames. 

The code sequences for the I and P frames are 
supplied to a variable-length decoder 62, to be restored 
to fixed code length format, then reconstructed predic- 
tion error values for a frame are derived by a dequan- 
tizer 75 and inverse DCT section 80, and added to 
predicted values for that frame by an adder 79, to obtain 
pixel values expressing reconstructed pictures, which 
are stored in a picture memory 63. 

The inter-picture prediction section 64 generates 
prediction signals and supplies these to the adder 79 (in 
the case of the I and P frames) and to the adder 41 (in 
the case of the B frames). The inter-picture prediction 
section 64 differs in operation from the inter-picture pre- 
diction section 57 in that it does not perform motion esti- 
mation or prediction mode selection, and operates only 
in accordance with the transmitted information, so that 
the amount of processing which is executed is much 
less than that executed by the inter-picture prediction 
section 57. 

The code sequences for the B frames are decoded 
by the variable-length decoder 38, dequantizer 39, and 
inverse DCT section 40 to obtain reconstructed predic- 
tion error values, which are added to the corresponding 
predicted pixel values by an adder 41 , to thereby obtain 
reconstructed B frames. 

The output selection switch 42 selects the values 
for the reconstructed I and P frames, read out from the 
picture memory 63, and the reconstructed B frames, 
produced from the adder 41 , to be supplied to the pic- 
ture output terminal 43. This is executed such that the 
order in which the sets of values for respective frames 
are supplied to the picture output terminal 43 is identical 
to the picture sequence of the original video signal (prior 
to encoding), rather than the order in which the encoded 
data sequences for the frames are output from the video 
encoding apparatus. 

The pixel values for each frame are supplied to the 
output selection switch 42 from from the picture memory 
63. or obtained from the adder 41 , as a set of values for 
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the first field followed by a set of values for the second 
field of the frame, so that an interlaced video signal is 
obtained from output terminal 43. 

With a prior art type of interlaced video encoding 
apparatus, the problem arises that even if there is only a s 
small amount of picture motion, it becomes impossible 
to perform accurate inter-picture prediction, due to the 
aliasing components which are contained in the inter- 
laced picture signal. As mentioned above, even if there 
is only a parallel shift of the picture, a substantial 10 
amount of inter-picture prediction error will be produced. 
On the other hand, if all of the the interlaced pictures 
(i.e. each field of each frame) were to be converted to 
progressive scanning pictures (i.e. to respective pro- 
gressive scanning frames) and then encoded, the 15 
above problem could be avoided. However since the 
total number of scanning lines of every picture would be 
doubled, the amount of encoding and decoding 
processing which must be executed would be corre- 
spondingly doubled, so that practical apparatus would 20 
be difficult 

It is an objective of the present invention to provide 
a video encoding apparatus and video encoding 
method, and a video decoding apparatus and video 
decoding method, and an encoding recorded medium, 25 
whereby the above problems of the prior art can be 
overcome. 

A further problem which arises in the prior art is that 
when an interlaced video signal is obtained by repro- 
duction from a recorded medium, or from a transmission 30 
source, the reconstructed interlaced pictures are not 
suitable for display by a progressive scanning type of 
monitor, such as is generally used to display text, 
images, etc., in the field of computers and data process- 
ing. 35 

It is therefore a further objective of the present 
invention to provide a decoding apparatus, for operating 
on code which is generated by a video encoding appa- 
ratus according to the present invention as described 
above, whereby an output decoded video signal is 40 
obtained which is a progressive scanning video signal, 
and whereby each of the fields of the original interlaced 
video signal has been converted to a progressive scan- 
ning frame of the output video signal from the video 
decoding apparatus. 45 

SUMMARY OF THE INVENTION 

To overcome the problems of the prior art dis- 
cussed above, the present invention provides a video so 
encoding apparatus and method, and corresponding 
video decoding apparatus and method whereby an 
interlaced video signal can be encoded and subse- 
quently decoded as a series of code sequences 
expressing pictures which have been respectively 55 
encoded either by intra-picture encoding (i.e., I pic- 
tures), by unidirectional predictive encoding (i.e., P pic- 
tures) or by bidirectional predictive encoding (i.e. the B 
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pictures), which differs from the prior art in that: 

(a) The I, P and B pictures are selected in units of 
fields of the interlaced video signal, rather than 
frames, and 

(b) Each of the fields which is selected as an I or P 
picture, i.e. which is utilized as a reference picture 
in prdictive encoding, is converted to a progressive 
scanning frame, having twice the number of scan- 
ning lines of the fields of the original video signal. 

That is to say. improved prediction is achieved by 
periodically selecting specific fields of an interlaced 
video signal to be converted to respective progressive 
scanning frames, performing encoding and decoding of 
each such progressive scanning frame by independent 
encoding or by unidirectional predictive encoding, while 
leaving the remaining fields unchanged as interlaced 
scanning fields, and performing bidirectional predictive 
encoding and decoding of such interlaced scanning 
fields by using preceding and succeeding progressive 
scanning frames as reference frames. 

As a result, since the number of scanning lines of 
each field selected as a B picture for encoding is left 
unchanged, the conversion of the I and P pictures to 
progressive scanning frames does not result in an 
excessive amount of code being generated. However 
the encoding distance (i.e., between each frame or field 
which is encoded and the progressive scanning frames 
which are used as reference pictures for that encoding) 
can thereby be effectively halved, by comparison with 
prior art methods, a substantially improved combination 
of encoding efficiency and accuracy of motion predic- 
tion can be achieved than has been possible in the prior 
art. 

More specifically, according to a first aspect the 
invention provides a video encoding apparatus for 
encoding an interlaced video signal, comprising 

progressive scanning conversion means for con- 
verting one field of every m fields (where m is an 
integer of value 2 or greater) of said interlaced 
video signal to a single progressive scanning frame 
which has twice the scanning line density of an 
interlaced field, while leaving the remaining fields 
unchanged as interlaced scanning fields, 
first encoding means for encoding each said pro- 
gressive scanning frame either by independent 
internal encoding of the frame or by unidirectional 
predictive encoding based on progressive scanning 
frames which have been encoded, and 
second encoding means for performing predictive 
encoding of each remaining field of said video sig- 
nal other than said fields which are converted to 
progressive scanning frames, using, as reference 
frames for said predictive encoding, selected ones 
of said progressive scanning frames which precede 
and succeed said each remaining field along the 
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time axis. 

Such a video encoding apparatus comprises inter- 
picture prediction means for deriving prediction signal 
values based on said progressive scanning frames 
which have been encoded, and said second encoding 
means comprises decimation means for receiving from 
said inter-picture prediction means, in synchronism with 
execution of encoding processing for said each remain- 
ing field, a set of prediction signal values derived based 
on said preceding and succeeding progressive scan- 
ning frames, executing decimation processing of said 
set to remove all prediction signal values other than 
sub-sets of prediction signal values which respectively 
corresponding to scanning lines of said each remaining 
field, and using remaining ones of said prediction signal 
values of the set in said predictive encoding of said each 
remaining field. 

According to a second aspect, the invention pro- 
vides a video decoding apparatus for executing decod- 
ing of successive code sequences which have been 
generated by encoding of progressive scanning frames 
derived by doubling the scanning line density of one in 
every m fields of an original interlaced video signal 
(where m is an integer of 2 or greater), with the encod- 
ing process using intra-frame predictive encoding or 
unidirectional predictive encoding, and by encoding 
remaining fields of said video signal directly as inter- 
laced scanning fields, with the encoding process using 
bidrectional predictive encoding based on preceding 
and succeeding ones of said progressive scanning 
frames, the video decoding apparatus comprising: 

first decoding means for decoding each of said 
code sequences corresponding to said progressive 
scanning frames by intra-frame decoding of said 
code sequence or by unidirectional predictive 
decoding of said code sequence using decoded 
ones of said progressive scanning frames, 
second decoding means for performing bidirec- 
tional predictive decoding of respective code 
sequences corresponding to each of said fields 
which have been encoded as interlaced scanning 
fields, using as reference frames decoded ones of 
said progressive scanning frames which precede 
and succeed the field corresponding to said code 
sequence along the time axis, and 
picture reconfiguration means for performing deci- 
mation of scanning lines of each of the progressive 
scanning frames which have been decoded by the 
first decoding means, to obtain a converted field 
having the same number of scanning lines as an 
interlaced scanning field, and executing time-axis 
combination of respective ones of said converted 
fields with said fields which are derived by said sec- 
ond decoding means, in an appropriate sequence 
for reproducing said original interlaced video signal. 



Such a video decoding apparatus comprises inter- 
picture prediction means for deriving prediction signal 
values based on decoded ones of said progressive 
scanning frames, while said second encoding means 
5 comprises decimation means for: 

receiving from said inter-picture prediction means, 
in synchronism with execution of said decoding 
processing for a field which was encoded as an 

w interlaced scanning field, a set of prediction signal 
values derived based on said decoded preceding 
and succeeding progressive scanning frames, 
executing decimation processing of said set to 
remove all prediction signal values other than sub- 

75 sets of prediction signal values which respectively 
correspond to scanning lines of said field, and 
applying remaining ones of said prediction signal 
values of the set in said predictive decoding of the 
code sequence corresponding to said field. 

20 

According to a third aspect, the invention provides a 
video decoding apparatus for executing decoding of 
encoded progressive scanning frames derived by dou- 
bling the scanning line density of one in every m fields 

25 of an interlaced video signal (where m is an integer of 2 
or greater), with the encoding process using intra-frame 
encoding or unidirectional predictive encoding, and 
encoding of remaining fields of said video signal directly 
as interlaced scanning fields, the video decoding appa- 

30 ratus comprising 

decoding means for decoding each of said encoded 
progressive scanning frames by intra-frame decod- 
ing or by unidirectional predictive decoding using 
35 decoded ones of said progressive scanning frames, 
to thereby obtain a first series of reconstructed pic- 
tures, 

prediction error decoding means for decoding of 
prediction error values for each of said remaining 

40 fields which are encoded as interlaced scanning 
fields, interpolation means for performing over- 
sampling of said prediction error values in a vertical 
scanning direction of afield, to derive reconstructed 
prediction error values corresponding to scanning 

45 lines of progressive scanning frames, 

inter-picture prediction means for deriving predic- 
tion signal values for each of said fields which were 
encoded unchanged as interlaced scanning fields, 
using a preceding one and a succeeding one of 

so said first series of reconstructed pictures as refer- 
ence frames. 

adder means for adding to said prediction signal 
values said reconstructed prediction error values 
corresponding to progressive scanning lines, to 
55 thereby obtain a second series of reconstructed 
pictures, and picture reconfiguration means for 
inserting said second series of reconstructed pic- 
tures into said first series of reconstructed pictures 
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to obtain a reconstructed video signal in which all 
pictures are expressed as progressive scanning 
frames. 

According to a fourth aspect, the invention provides 
an encoded video recorded medium having recorded 
thereon code expressing interlaced video information 
which has been encoded by high-efficiency encoding of 
an interlaced video signal, said encoding being charac- 
terized in that: 

one in every m fields of said interlaced video signal 
(where m is an integer of 2 or greater) is converted 
to a progressive scanning frame having double the 
number of scanning lines while remaining fields are 
left unchanged as interlaced scanning fields, 
each of the progressive scanning frames is either 
independently internally encoded or is encoded by 
unidirectional prediction encoding using as refer- 
ence frames progressive scanning frames which 
have already already been encoded, to thereby 
obtain respective first code sequences correspond- 
ing to said progressive scanning frames, 
each of the remaining fields which are left 
unchanged as interlaced scanning fields is 
encoded by bidirectional prediction using preceding 
and succeeding ones of the encoded progressive 
scanning frames as reference frames, to thereby 
obtain a respective second code sequences corre- 
sponding to said remaining fields, and 
said first and second code sequences are recorded 
to constitute said encoded video recorded medium. 

According to a fifth aspect, the invention provides a 
method of encoding an interlaced video signal, compris- 
ing: 

converting one field of every m fields (where m is an 
integer of value 2 or greater) of said interlaced 
video signal to a single progressive scanning frame 
which has twice the scanning line density of an 
interlaced field, while leaving remaining fields of 
said video signal unchanged as interlaced scan- 
ning fields, encoding each said progressive scan- 
ning frame either by independent internal encoding 
of the frame or by unidirectional predictive encoding 
based on progressive scanning frames which have 
been encoded, and performing bidirectional predic- 
tive encoding of each the remaining fields of said 
video signal other than said fields which are con- 
verted to progressive scanning frames, using, as 
reference frames for deriving respective sets of pre- 
diction signal values corresponding to each of said 
remaining fields, selected ones of said progressive 
scanning frames which precede and succeed said 
each remaining field along the time axis. 

With such a method decimation processing is 



applied to each of said sets of prediction signal values 
to remove respective sub-sets of prediction signal val- 
ues which do not correspond to scanning lines of said 
each remaining field, to thereby obtain a set of predic- 

5 tion signal values for use in predictive encoding of said 
each remaining field. 

According to a sixth aspect, the invention provides 
a method of decoding successive code sequences 
which have been generated by a first encoding process 

w applied to respective progressive scanning frames 
which have been derived by doubling the scanning line 
density of one in every m fields of an original interlaced 
video signal (where m is an integer of 2 or greater), with 
the first encoding process using intra-frame predictive 

75 encoding or unidirectional predictive encoding, and by 
encoding remaining fields of said video signal directly 
as interlaced scanning fields in a second encoding 
process, with the second encoding process using bidi- 
rectional predictive encoding based on preceding and 

20 succeeding ones of said progressive scanning frames, 
with the video decoding method comprising: 

executing decoding each of respective code 
sequences corresponding to each of said progres- 

25 sive scanning frames by intra-frame decoding of 
said code sequence or by unidirectional predictive 
decoding of said code sequence using decoded 
ones of said progressive scanning frames, execut- 
ing predictive decoding of respective code 

30 sequences corresponding to each of said fields 
which have been encoded as interlaced scanning 
fields, using as reference frames decoded ones of 
said progressive scanning frames which precede 
and succeed the field corresponding to said code 

35 sequence along the time axis, performing decima- 
tion of scanning lines of each of the progressive 
scanning frames which have been decoded by the 
first decoding means, to obtain a converted field 
having the same number of scanning lines as an 

40 interlaced scanning field, and combining respective 
ones of said converted fields and said fields which 
are derived by said second decoding means, along 
the time axis, in an appropriate sequence for repro- 
ducing said original interlaced video signal. 

45 

With such a method, the predictive decoding of a 
code sequence corresponding to a field which has been 
encoded as interlaced scanning fields is performed by 
deriving a set of prediction signal values based on 

so decoded progressive scanning frames which precede 
and succeed said field along the time axis, executing 
decimation processing of said set to remove all predic- 
tion signal values other than sub-sets of prediction sig- 
nal values which respectively correspond to scanning 

55 lines of said field, and applying remaining ones of said 
set of prediction signal values in said predictive decod- 
ing of the code sequence corresponding to said field. 
According to a seventh aspect, the invention pro- 
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vides a method of decoding successive code 
sequences which have been generated by encoding 
respective progressive scanning frames derived by dou- 
bling the scanning line density of one in every m fields 
of an original interlaced video signal (where m is an 5 
integer of 2 or greater), with the encoding process using 
intra-frame encoding or unidirectional predictive encod- 
ing, and by encoding remaining fields of said video sig- 
nal directly as interlaced scanning fields, 

10 

the decoding method comprising: 

decoding each of said progressive scanning 
frames by intra-frame decoding or by unidirec- 
tional predictive decoding using previously 15 
decoded ones of said progressive scanning 
frames, to thereby obtain a first series of recon- 
structed pictures, 

decoding respective sets of prediction error val- 
ues for each of said remaining fields which are 20 
encoded as interlaced scanning fields, 
performing over-sampling of said prediction 
error values in a vertical scanning direction of a 
field, to generate reconstructed prediction error 
values corresponding to scanning lines of a 25 
progressive scanning frame, 
deriving, for said each field which was encoded 
as an interlaced scanning field, prediction sig- 
nal values based on selected ones of said first 
series of progressive scanning frames which 30 
precede and succeed said each field along the 
time axis, as reference frames, 
adding said prediction signal values to said 
reconstructed prediction error values, to 
thereby obtain a second series of recon- 35 
structed pictures, and inserting said second 
series of reconstructed pictures into said first 
series of reconstructed pictures to obtain a 
reconstructed video signal in which all pictures 
are expressed as progressive scanning 40 
frames. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a general system block diagram of an 
embodiment of a video encoding apparatus accord- 
ing to the present invention; 
Fig. 2 is a conceptual diagram for illustrating the 
relationships between the scanning line configura- 
tions of successive I. P progressive scanning 
frames and B fields which are encoded by the 
embodiment of Fig. 1 1 ; 

Fig. 3 is a general system block diagram of a first 
embodiment of a video decoding apparatus accord- 
ing to the present invention, for use in conjunction 
with the video encoding apparatus of Fig. 1 ; 
Fig. 4 is a conceptual diagram for illustrating the rel- 
ative amounts of encoding processing time periods 
which are available for the I, P and B pictures 
respectively, if the video encoding apparatus 
embodiment of Fig. 1 were to be modified to use a 
single encoding system in common for encoding 
the I, P and B pictures by time-sharing operation; 
Fig. 5 is a general system block diagram of an 
example of a prior art type of video encoding appa- 
ratus; 

Fig. 6 is a general system block diagram of an 
example of a prior art type of video decoding appa- 
ratus for use in conjunction with the video encoding 
apparatus of Fig. 5; 

Fig. 7 is a general system block diagram of a sec- 
ond embodiment of a video decoding apparatus 
according to the present invention, for use in con- 
junction with the video encoding apparatus of Fig. 
1, whereby an encoded field-interlaced type of 
video signal is converted by the encoding/decoding 
process to a progressive scanning video signal; 
and 

Fig. 8 is a conceptual diagram for illustrating the 
relationships between the scanning line configura- 
tions of successive I, P and B progressive scanning 
frames which are generated by the video decoding 
apparatus embodiment of Fig. 7. 



As a result, the present invention can provide a 
video encoding method and apparatus and video 
decoding method and apparatus whereby an interlaced 
video signal can be encoded with a very high level of 
encoding efficiency, and furthermore whereby the 
resultant code, after having been transmitted and 
received, or recorded and reconstructed, can be 
decoded to recover the original video signal as an inter- 
laced signal, or as a progressive scanning video signal 
in which each field of the originally encoded video signal 
has been converted to a progressive scanning frame 
(having double the number of scanning lines of an inter- 
laced field) which can be directly displayed by various 
types of data processing display apparatus, etc.. that 
can only utilize a progressive scanning video signal. 



DESCRIPTION OF PREFERRED EMBODIMENTS 

45 An embodiment of a motion-compensation encod- 
ing apparatus according to the present invention will be 
described in the following referring to the system block 
diagram of Fig. 1 . In Fig. 1 , elements that are identical in 
function and operation to elements of the prior art video 

so encoding apparatus example of Rg. 5 are designated 
by identical numerals to those of Fig. 5. Basically, the 
apparatus of Rg. 1 differs from that of Fig. 5 by including 
a progressive scanning conversion section 1 and scan- 
ning line decimation section 15. Also, the frame delay 

55 element 61 of the prior art apparatus of Fig. 5 is 
changed to a field delay element 16, and the operation 
of the input signal selection switch 8 and of the inter-pic- 
ture prediction section 9 respectively differ from those of 
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the input signal selection switch 56 and inter-picture 
prediction section 57 of the prior art video encoding 
apparatus example of Fig. 5. 

The processing executed with this embodiment will 
be described by comparison with that of the prior art s 
video encoding apparatus example of Fig. 5. With this 
embodiment, the picture processing units are respec- 
tive fields of the input interlaced video signal which is 
supplied to the video signal input terminal 7, and the 
picture type (i.e., I, P, B) is also established in units of w 
these interlaced fields. Thus, the time-axis separation 
between successive I, P or B pictures is the field period, 
e.g., 1/60 second. 

The major features of this embodiment are as fol- 
lows. The fields of the input video signal which are to be is 
processed as I and P type pictures are each subjected 
to conversion processing to increase the scanning line 
density, i.e. interpolation of scanning lines is performed 
to double the number of scanning lines per field and 
thereby achieve conversion to progressive scanning 20 
frames. The fields of the input video signal which are to 
be processed as B type pictures are left unchanged, i.e. 
are encoded in units of fields, in a similar manner to that 
described for the B type fields of the prior art example of 
Fig. 5. 25 

Fig. 2 shows the arrangement of scanning lines of 
the resultant picture types. With a standard resolution 
television system, the effective number of scanning 
lines is 460 lines per frame, 240 lines per field. Since 
each of the I and P type pictures is converted to have so 
twice the number of scanning lines of the original fields, 
the amount of encoding processing which must be exe- 
cuted for each of these is accordingly doubled. 

Encoding of the I and P progressive scanning 
frames is not executed as for the prior art example, i.e. 35 
based on the two fields/frame configuration. Instead, as 
can be clearly understood from the scanning line config- 
uration illustrated in Fig. 2, prediction is executed for 
each P type progressive scanning frame based on a 
preceding I type or P type progressive scanning frame, 40 
and is executed for each B type field based on preced- 
ing and succeeding ones of the I type or P type progres- 
sive scanning frames. It can thus be seen that the 
encoding processing can be simplified by comparison 
with the prior art method of the example of Fig. 5. 45 

In the case of the prediction processing which is 
applied to each interlaced scanning field which is to be 
encoded as a B field, the operation is as follows. A set 
of prediction signal values is generated by the inter-pic- 
ture prediction section 9, (in synchronism with input to so 
the subtracter 1 7 of the pixel values for the field which is 
to be encoded) based upon specific I or P progressive 
scanning frames which succeed and precede that inter- 
laced field. Since that set has been derived using pro- 
gressive scanning frames, it contains twice the number ss 
of prediction signal values that are required for encoding 
an interlaced field, i.e. the set includes respective sub- 
sets of values which correspond to scanning lines that 



are omitted from the interlaced field. For that reason, 
each such set of prediction signal values is subjected to 
decimation processing by the scanning line decimation 
section 15, to eliminate each of these sub-sets of pre- 
diction signal values corresponding to respective scan- 
ning lines which do not appear in the field which is being 
encoded. 

It will be apparent that this operation of the scan- 
ning line decimation section 15 must be controlled in 
accordance with whether the field which is being 
encoded contains the even-numbered or odd-numbered 
scanning lines of an interlaced scanning frame, so that 
appropriate sub-sets of prediction signal values will be 
eliminated. TTiat is to say the scanning line decimation 
section 15 must extract, from a set of prediction signal 
values supplied from the inter-picture prediction section 
9, those values which match the scanning lines of the 
field which is to be encoded. However since it is inher- 
ently necessary for various types of prior art apparatus 
which operate on an interlaced video signal to include 
means for discriminating between the even and odd 
number fields (i.e. the first and second fields of a frame) 
based on the synchronizing signals contained in an 
interlaced video signal and to control the operation of 
the apparatus in accordance with whether an even or 
odd number field is being processed, so that the tech- 
nology is extremely well-known, detailed description of 
such discrimination and control operation will be omit- 
ted. 

Referring to Fig. 2, for example, assuming that the I 
and P progressive scanning frames designated by 
numerals 200, 203 are used to derive prediction signal 
values for encoding the interlaced scanning field 201 as 
a B field, the first scanning lines of the fields 201, 202 
being indicated as 201a, 202a, then the result of the 
operation of the inter-picture prediction section 9 and 
scanning line decimation section 15 will be as follows. 
Of a complete set of prediction signal values which are 
generated by the inter-picture prediction section 9 (i.e., 
a set which would be appropriate for encoding a pro- 
gressive scanning frame) allsub-sets which correspond 
to scanning lines that do not occur in the field 201 are 
eliminated by the scanning line decimation section 15. 

It will be apparent that with such a method, the the 
problems which arise in the prior art when performing 
such encoding due to the 1-line vertical displacement 
and 1 -field period time-axis displacement between the 
two interlaced fields of an I frame or a P frame, can be 
eliminated. 

The operation of the input selection switch 8 is syn- 
chronized with successive fields of the input video sig- 
nal. As mentioned above, the technology relating to 
such control is very well known, so that detailed descrip- 
tion of specific switch control apparatus is omitted. 

Typically, values for m in the range 3 to 6 are appro- 
priate, i.e., values which are larger than those used in 
the prior art, and the proportion of I frames in the overall 
stream of I, P, B pictures can be made accordingly 
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greater than is possible with prior art methods, without 
making the prediction distance excessively large, so 
that the amount of encoded data can be substantially 
reduced by comparison with the prior art. Conversely, if 
for example the same value m were to used as for the 
prior art example, then the prediction distance would be 
halved, by comparison with the prior art example, so 
that an according increase in motion prediction accu- 
racy would be obtained. 

The progressive scanning conversion section 1 can 
be configured as shown in Japanese Patent Laid-open 
No. HEI 8-130716, whereby motion prediction is per- 
formed in units of small blocks, using preceding and 
succeeding fields, with interpolation of scanning lines 
which have been omitted from a field due to the inter- 
laced scanning. Since the contents (i.e., pixel values) of 
preceding and succeeding interlaced fields are required 
for this interpolation operation, it will be understood that 
the progressive scanning conversion section 1 includes 
any necessary delay elements for achieving this, such 
as a field memory. 

The operation of each of the subtracter 2, the DCT 
section 3, the quantizer 4, and the variable-length 
encoder 5 is basically identical to that of the corre- 
sponding element of the prior art example. However 
since there is 1 progressive scanning frame in each field 
period of the original video signal, e.g., 1/60 second, it 
is necessary to execute processing at twice the speed 
of the prior art example, if real-time processing is to be 
achieved. 

The same is also true for the operation of the 
dequantizer 6, the inverse DCT section 12 and the 
adder 1 1 which constitute the local decoding section. 
The operating speed of the picture memory 10 is the 
same as that of the inverse DCT section 12, however 
the memory capacity is the same as that of the prior art 
example. That is to say, with the prior art example there 
are two interlaced fields per I frame or P frame whereas 
with the present invention each I frame or P frame is a 
single progressive scanning frame having twice the 
number of scanning lines of an interlaced field. 

With the present invention, since the inter-picture 
prediction section 9 can execute processing simply in 
units of progressive scanning frames, the operation can 
be simpler than for the prior art example. Specifically, 
motion estimation is performed within the inter-picture 
prediction section 9 by operating on blocks of 16 x 16 
pixels or 8 x 8 pixels, and motion compensation is exe- 
cuted in accordance with the detected motion vectors. 
In general, the accuracy of motion compensation is to 
within 1/2 of a pixel. 

In the case of the B fields, a delay of (m - 1) fields 
must be applied to the picture signal by the field delay 
element 16. Since the picture types (i.e., f, R B) are 
established in units of interlaced scanning fields of the 
input video signal as described above, the delay applied 
by the field delay element 16 is set in units of field peri- 
ods. However in addition, for the the progressive scan- 



ning conversion section 1 to perform inter-field 
interpolation for generating the I and P progressive 
scanning frames, a delay of one field must occur 
between inputting video signal values for a field to the 

5 progressive scanning conversion section 1 and output 
of resultant video signal values for a progressive scan- 
ning frame. Thus, it is necessary to apply a correspond- 
ing amount of delay to the B fields to compensate for 
this. As a result, the total delay applied by the field delay 

w element 16 must be m field periods (e.g., m/60 sec- 
onds). 

Trie operation of each of the subtracter 17, DCT 
section 18, quantizer 19 and variable-length encoder 20 
is basically identical to that of the prior art example, but 

75 with processing being executed in units of fields. That is 
to say, as described above, the prediction signal which 
is supplied to the subtracter 17 to derive difference val- 
ues (i.e., values of prediction error) for a B field is 
derived from preceding and succeeding I or P progres- 

20 sive scanning frames which are separated from that B 
field by one or more field periods. 

It will be understood that such a form of operation is 
basically different from the prior art in that, since the pic- 
ture types (i.e., I, P, B) are established in units of fields, 

25 isolated I pictures and P pictures will occur as shown in 
Fig. 2. However with a prior are method such as that of 
the apparatus of Fig. 5, pairs of I pictures (expressed as 
interlaced fields) will occur consecutively, as will pairs of 
P pictures. 

30 In Fig. 1, the processing system that is constituted 
by the subtracter 2, the DCT section 3, the quantizer 4 
and the variable-length encoder 5 is configured sepa- 
rately from the processing system that is constituted by 
the subtracter 1 7, DCT section 1 8, quantizer 1 9 and var- 

35 iable-length encoder 20. However, since the processing 
executed by these two systems is basically the same, 
these could be combined into a single system, through 
use of time-sharing operation. 

In that case, since the amount of processing 

40 required for an I frame or P frame is double the amount 
of processing required for a B field, the relationships 
between the processing timings for encoding the I, P 
and B fields could be as shown conceptually in the 
example designated as "Processing a", in Fig. 4. As 

45 shown, since the processing time intervals required for 
each of the I and P frames is twice that required for a B 
field, it is necessary to make the processing time for a B 
field shorter than a field period (e.g., shorter than 1/60 
second). With the example shown, assuming a value of 

so 3 for m, the encoding processing for each B field must 
be completed within 0.75 of a field period, while the 
encoding processing for each I or P field must be com- 
pleted within 15 field periods. 

On the other hand, as shown in Japanese Patent 

55 Laid-open No. HEI 6-311505, subsampling of the pre- 
diction error values obtained for the B fields can be 
applied, to halve the amount of processing performed 
by the stages which succeed the DCT section 1 8. In that 
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case, again assuming that a value of 3 is used for m, 
then as indicated by the "Processing b" example in Fig. 
4, real-time encoding processing of the I and P fields 
becomes possible, i.e., two field periods are available 
for executing such processing, in the same way as for 5 
the configuration of Fig. 1 . 

As a specific configuration, a subsampler operating 
along the horizontal direction of each B field can be 
inserted between the subtracter and the DCT section of 
such a combined configuration (i.e. which respectively 10 
perform the functions of the subtracters 2, 1 7 and the 
DCT sections 3, 18 in the embodiment of Fig. 2). 

Fig. 3 shows an embodiment of a video decoding 
apparatus corresponding to the video encoding appara- 
tus of Fig. 1 for decoding the output code generated by 15 
that video encoding apparatus, e.g., when transmitted 
code is received or recorded code is reconstructed. 

In Fig. 3, elements that are identical to elements of 
the prior art example of Fig. 6 are designated by identi- 
cal numerals. The configuration of Fig. 3 differs from 20 
that of the prior art example of Fig. 6 by including scan- 
ning line decimation sections 81 and 36. Also, the oper- 
ation of the inter-picture prediction section 35 differs 
from that of the prior art example of Fig. 6, while in addi- 
tion the processing executed by each of the variable- 25 
length decoder 31, the dequantizer 6, the inverse DCT 
section 12 and the adder 1 1 with respect to the I and P 
pictures is in units of progressive scanning frames. 

The code which is input from the code input termi- 
nal 33 is separated by the demultiplexer 34 into the I 30 
and P frame code sequences and the B frame code 
sequences. The I and P frame code sequences are sub- 
jected to decoding processing by the variable-length 
decoder 31 , the dequantizer 6, the inverse DCT section 
1 2 and the adder 1 1 , in the same way as for the prior art 35 
example of Fig. 6, to thereby obtain sets of pixel values 
for reconstructed pictures expressed as respective pro- 
gressive scanning frames, which are then stored in a 
picture memory 32. 

To achieve complete real-time processing, it is nec- 40 
essary that processing of each progressive scanning 
frame be completed within one field period, e.g., 1/60 
second. 

The inter-picture prediction section 35 generates a 
prediction signal (expressing successive reconstructed 45 
prediction error values) based on the decoded progres- 
sive scanning frames stored in the memory 32, supplies 
this to the adder 11 for deriving pixel values for recon- 
structed I and P progressive scanning frames, and sup- 
plies a prediction signal to scanning line decimation so 
section 81 for use in decoding the B fields. The inter-pic- 
ture prediction section 35 operates in a basically similar 
manner to the inter-picture prediction section 9 of Fig. 1 . 
However the inter-picture prediction section 35 differs 
from the inter-picture prediction section 9 in that it does ss 
not perform motion estimation or prediction mode selec- 
tion. Thus the amount of processing performed by the 
inter-picture prediction section 35 is substantially less 
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than that executed by the inter-picture prediction section 

9. 

The B-picture code sequences supplied from the 
demultiplexer 34 (which correspond to respective inter- 
laced-scanning fields, as described above referring to 
Fig. 1) are decoded by the variable-length decoder 38, 
the dequantizer 39, the inverse DCT section 40 and by 
addition of prediction signal values in the adder 41, to 
obtain respective reconstructed B field signals (i.e., suc- 
cessive reconstructed pixel values for the B fields). 

The operation of the scanning line decimation sec- 
tion 81 of this embodiment in selecting appropriate sets 
of prediction signal values from those generated by the 
inter-picture prediction section 35, in relation to the 
scanning lines of the B fields which are to be decoded, 
is identical to that of the scanning line decimation sec- 
tion 15 of Fig. 1, which has been described in detail 
hereinabove. As a result of supplying these prediction 
signal values to the adder 41 in conjunction with 
decoded prediction error values for each B field which 
are output from the inverse DCT section 40, respective 
sets of reconstructed pixel values expressing these B 
fields are obtained, i.e. the corresponding interlaced 
fields of the original video signal are thereby obtained 
from the adder 41 , and supplied to the output selection 
switch 42. 

In addition, sets of reconstructed pixel values corre- 
sponding to respective I, P progressive scanning frames 
are read out from the picture memory 32, via the output 
lines designated as 32b in Fig. 3. Each of these sets is 
supplied to the scanning line decimation section 36, 
which eliminates specific sub-sets of pixel values which 
correspond to scanning lines that must be omitted in 
order to convert a progressive scanning frame to an 
appropriate (i.e. odd-numbered or even-numbered) 
interlaced scanning field. The values expressing that 
field are then supplied to the output selection switch 42. 

The output selection switch 42 is controlled, in con- 
junction with control of read out of values from the pic- 
ture memory 32, to transfer the sets of values for 
reconstructed fields which have been derived from the I 
and P progressive scanning frames and the sets of val- 
ues for the reconstructed B fields to the video output ter- 
minal 43 in the same order of interlaced scanning fields 
as that of the original video signal prior to encoding. A 
reconstructed output interlaced video signal is thereby 
obtained, with the fields having been restored to the 
time-axis order of the original video signal, rather than 
that of the code sequences representing respective I, P 
and B pictures which are output from the video encod- 
ing apparatus of Fig. 1. 

It should be noted that the scanning line decimation 
sections 81 and 36 perform processing which differs 
from subsampling processing in that no filter processing 
is executed prior to the decimation operation. This is 
because the reconstructed progressive scanning pic- 
tures are derived from original interlaced pictures which 
have been converted to progressive scanning form, so 
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that the vertical scanning frequency characteristic is 
already limited to a degree that is appropriate for an 
interlaced signal, prior to the decimation processing. 

As can be understood from the above description, 
the video decoding apparatus embodiment of Fig. 3, in 5 
combination with the video encoding apparatus embod- 
iment of Fig. 1, enables a system for transmitting or 
recording an interlaced video signal as encoded data 
with a very high encoding efficiency. This results from 
the following reasons. Since the I, P and B pictures w 
respectively corresponding to fields (of the original inter- 
laced video signal), rather than units of 2-field frames, if 
the relative proportions of I, P and B pictures were to be 
made identical to that of used with a prior art system 
such as that of Figs. 5, 6, the encoding distance (e.g., 15 
as represented by the time-axis separation between 
pictures selected as I pictures) would be halved, and the 
motion prediction accuracy accordingly doubled. Alter- 
natively stated, with the present invention it would be 
possible to double the proportion of B pictures in rela- 20 
tion to the I and P pictures, while maintaining the same 
level of prediction accuracy. Since the amount of code 
that is generated for a B picture is less than that for a P 
picture, and very much less than that for an I picture, 
enhanced encoding efficiency can be achieved. 25 

In a similar manner to that described hereinabove 
for the video encoding apparatus of Fig. 1, H would be 
possible to modify the video decoding apparatus of Fig. 
3 to perform processing of both the I and P progressive 
scanning frames and also the B fields by a single sys- 30 
tern, through appropriate time-sharing operation. In that 
case, the apparatus could be configured to perform sub- 
sampling of the reconstructed prediction error values for 
the B fields, by inserting a subsampler operating along 
the horizontal direction of each field, i.e. with the sub- 35 
sampler being inserted (during each interval in which a 
B field is being decoded) between the inverse DOT sec- 
tion that would be used in common for decoding I, P and 
B pictures and the adder which receives the output val- 
ues from that inverse DCT section. In that way it would 40 
become possible to increase the time available for 
decoding each of the I and P progressive scanning 
frames, in the same manner as described above refer- 
ring to Fig. 4 for the case of encoding processing. 

Fig. 7 shows a second embodiment of a decoding 45 
apparatus for operating on the encoded video data gen- 
erated by the encoding apparatus of Fig. 1. In Fig. 7, 
elements that are identical to elements of the prior art 
example of Fig. 6 are designated by identical numerals. 
The configuration of Fig. 7 differs from that of the prior so 
art example of Fig. 6 by including a scanning line inter- 
polation section 44, while in addition the operation of the 
inter-picture prediction section 35 differs from that of the 
prior art example of Fig. 6, and furthermore the 
processing executed by each of the variable-length 55 
decoder 31 , the dequantizer 6, the inverse DCT section 
12 and the adder 1 1 with respect to the I and P pictures 
is in units of progressive scanning frames, in the same 



way as for the video decoding apparatus embodiment of 
Fig. 3 described above. 

The code which is input from the code input termi- 
nal 33 is separated by the demultiplexer 34 into the I 
and P frame code sequences, i.e., for respective pro- 
gressive scanning frames, and the B field code 
sequences, for respective interlaced scanning fields. 
The I and P frame code sequences are subjected to 
decoding processing by the variable-length decoder 31 , 
the dequantizer 6, the inverse DCT section 12 and the 
adder 11, in the same way as for the prior art example 
of Fig. 6, to thereby obtain sets of reconstructed pixel 
values expressing respective I and P progressive scan- 
ning frames, which are stored in the picture memory 32. 

To achieve complete real-time processing, it is nec- 
essary that the decoding processing of each progres- 
sive scanning frame be completed in one field period 
(e.g., 1/60 second). 

The inter-picture prediction section 35 generates a 
prediction signal in the same way as described for the 
embodiment of Fig. 3, based on the progressive scan- 
ning frames stored in the memory 32, and supplies this 
to the adder 1 1 for deriving pixel values for the recon- 
structed I and P progressive scanning frames. In addi- 
tion, the inter-picture prediction section 35 supplies a 
prediction signal to the adder 41, for deriving pixel val- 
ues for reconstructed progressive scanning frames 
which are obtained from decoded B fields as described 
in the following. 

The B frame code sequences are decoded by the 
variable-length decoder 38, the dequantizer 39 and the 
inverse DCT section 40 to obtain respective sets of 
reconstructed prediction error values for the B fields, 
i.e., this processing is performed in units of interlaced 
scanning fields. Each set of reconstructed prediction 
error values thereby obtained for a field is then sub- 
jected to scanning line interpolation (oversampling) in 
the vertical direction by the scanning line interpolation 
section 44, to obtain a corresponding set of recon- 
structed prediction error values for a progressive scan- 
ning frame. That is to say, the set of prediction error 
values corresponding to a 240 line field is converted to 
a set of prediction error values corresponding to a 480 
line progressive scanning frame, by generation and 
insertion of 240 sub-sets of interpolated prediction error 
values which correspond to respective interpolated 
scanning lines, before being supplied to the adder 41. 

This processing is executed within each field by 
oversampling, and is a simple form of processing by 
comparison with that executed by the progressive scan- 
ning conversion section 1 of the video encoding appara- 
tus of Fig. 1 . 

The prediction signal values which are output from 
the inter-picture prediction section 35 are added, in the 
adder 41 , to the prediction error values which have been 
converted to progressive scanning frame format by the 
scanning line interpolation section 44, to thereby obtain 
sets of reconstructed pixel values for respective pro- 
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gressive scanning frames that have been derived from 
decoded B fields. 

The output selection switch 42 receives the pictures 
expressed as respective reconstructed I and P progres- 
sive scanning frames which are read out from the pic- s 
ture memory 32 at appropriate timings (as described for 
the embodiment of Fig. 3 and the pictures expressed as 
respective progressive scanning frames derived from 
the B fields which are output from the adder 41 , and out- 
puts these progressive scanning frames to the video 
output terminal 43 in the appropriate order for producing 
a progressive scanning video signal in which the 
sequence of the progressive scanning frames is identi- 
cal to that of the originally encoded interlaced video sig- 
nal and in which the frame period is half of the frame 
period of that original video signal. That is to say, in the 
same way as described for the embodiment of Fig. 3, 
the order in which data expressing respective recon- 
structed pictures are supplied to the video output termi- 
nal 43 is identical to the time-axis order of the 
respectively corresponding pictures (expressed by 
interlaced fields} of the original video signal, rather than 
that of the I, P, B code sequences which are received by 
the video decoding apparatus. 

Fig. 8 is a diagram for conceptually illustrating the 
successive progressive scanning frames which are 
thereby supplied to the video output terminal 43. 
Respectively different symbols are used in Fig. 8 to indi- 
cate scanning lines which are derived in each of three 
different ways, i.e. scanning lines (indicated by circles) 
which are reconstructed from original scanning lines of 
the interlaced fields of the video signal, scanning lines 
(indicated by diamonds) which are inserted by the oper- 
ation of the progressive scanning conversion section 1 
of the video encoding apparatus of Fig. 1, and scanning 
lines (indicated by stars) which are inserted as a result 
of the operation of the scanning line interpolation sec- 
tion 44 in conjunction with the adder 41 in the video 
decoding apparatus of Fig. 7. 

It can be understood from the above description 
that with the video decoding apparatus embodiment of 
Fig. 7, by effectively using the inter-picture prediction 
processing and introducing oversampling processing of 
the reconstructed prediction error values which are 
derived for the B fields, it becomes possible to achieve 
(for the B pictures, of the I, P, B picture types) the same 
type of progressive scanning conversion processing as 
that performed on the I and P pictures by the progres- 
sive scanning conversion section 1 of the embodiment 
of Fig. 1 . However with the embodiment of Fig. 7 it is 
made unnecessary to convert each of the B fields to 
progressive scanning frames prior to encoding and 
transmission. Thus, the video decoding apparatus 
embodiment of Fig. 7 in conjunction with the video 
encoding apparatus of Fig. 1 enables an extremely 
practical and efficient system to be implemented 
whereby an interlaced video signal can be subjected to 
high-efficiency encoding and the resultant code trans- 



mitted or recorded, and whereby the resultant received 
or reconstructed code can be decoded to obtain a pro- 
gressive scanning video signal. 

A further aspect of the present invention is that it 
enables a digital encoded video recorded medium hav- 
ing a high efficiency of encoding to be realized. Specifi- 
cally, this can be achieved by multiplexing the picture 
code sequences which are generated by the video 
encoding apparatus of Fig. 1 with code sequences for 
audio and control information using the MPEG system 
standards, adding error correction codes to the result- 
ant code, modulating a recording signal with the result- 
ant code and then recording the modulated recording 
signal on a recorded medium. 

That is to say, this aspect of the invention can pro- 
vide an encoded video recorded medium implemented 
as a recorded medium having recorded thereon inter- 
laced video information which has been subjected to 
high-efficiency encoding. This is achieved by recording 
on the recorded medium respective code sequences 
each of which is derived from a corresponding field of a 
field interlaced digital video signal, with the code 
sequences consisting of first and second code 
sequences, where each of the first code sequences is 
generated by converting one out of every m fields of the 
interlaced video signal into a progressive scanning 
frame having double the number of scanning lines of an 
interlaced field (where m is an integer of 2 or more), and 
performing encoding of each such progressive scan- 
ning frame either by intra-frame encoding or by unidirec- 
tional inter-frame predictive encoding using a 
progressive scanning frame as a reference frame, and 
where each of the second code sequences is generated 
by bidirectional inter-frame predictive encoding of one of 
the remaining fields of the video signal (i.e., which has 
been left unchanged as an interlaced scanning field) 
using preceding and succeeding ones of the progres- 
sive scanning frames as reference fields. 

The recording can be executed at high speed, in the 
case of a read-only type of recorded medium, by using 
a stamper, etc. Reproduced code which is read from 
such an encoded recorded medium can be decoded to 
obtain either the original interlaced video signal, by 
using the first video decoding apparatus embodiment of 
the invention, or to obtain a progressive scanning video 
signal, by using the second video decoding apparatus 
embodiment as described above. 

As can be understood from the above description of 
embodiments, with the present invention, one in m fields 
of a digital interlaced video signal (where m is an integer 
of 2 or greater) is converted to a progressive scanning 
frame and is encoded and decoded either independ- 
ently within the frame or by unidirectional prediction, 
while other fields which are left unchanged as interlaced 
fields are encoded and decoded by bidirectional predic- 
tion using preceding and succeeding progressive scan- 
ning frames. That is to say, all of the reference pictures 
which are utilized for inter-picture prediction consist of 
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progressive scanning frames. As a result, the time-axis 
deviations which occur when interlaced fields (i.e. with 
each field having a vertical displacement of one scan- 
ning line from the preceding field) are used as reference 
pictures for the purpose of inter-frame predictive encod- 
ing can be eliminated, and the criteria for correctly sam- 
pling respective scanning lines of a picture can be 
satisfied, so that no aliasing components are generated 
in the resultant code. Thus, enhanced code compres- 
sion can be obtained, since the encoding of aliasing 
components as a result of inter-picture encoding is elim- 
inated. 

Another advantage is as follows. In the prior art, 
when intra-frame encoding of an I frame is performed, 
the encoding must utilize two successive fields, which 
are mutually displaced by one field period along the 
time axis and also by one scanning fine along the verti- 
cal picture direction. As a result, aliasing components 
are generated by such intra-frame encoding. However 
with the present invention as described hereinabove, 
each I picture to which intra-frame encoding is applied 
consists of a single progressive scanning frame. Hence, 
the problem of generation of aliasing components is 
again eliminated by the method of the present invention. 

Moreover, since the reference pictures for inter- 
frame encoding always consist of progressive scanning 
frames, i.e., with the scanning line density being twice 
that of a field, the accuracy of motion estimation in the 
vertical direction of a picture is accordingly doubled, so 
that motion compensation is accordingly more accurate 
than is possible with the prior art method. 

For example, if the value of m (i.e., for selecting one 
in every m successive fields or frames as described 
hereinabove) were to be made the same as that used 
with the prior art example of video encoding apparatus, 
then since with the present invention the picture types 
(i.e., I, R B) are established in units of fields, the inter- 
picture prediction distance would be halved, in the case 
of a video encoding apparatus according to the present 
invention. Hence, the prediction error values would be 
accordingly made smaller. If on the other hand the inter- 
picture prediction distance were to be made the same 
as that of the prior art video encoding apparatus exam- 
ple, then by comparison with the prior art video encod- 
ing apparatus example, the value of m could be doubled 
and hence the proportion of B frames could be made 
accordingly greater, with the apparatus of the present 
invention. Since the B frames are encoded by bidirec- 
tional prediction, and so require a smaller amount of 
code than the P and I frames, a correspondingly greater 
degree of code compression can be achieved. 

Moreover according to another aspect of the inven- 
tion, in spite of the fact the conversion processing for 
converting interlaced scanning fields to progressive 
scanning frames is performed on only one in m of the 
total number of fields, it is possible with the present 
invention (as described hereinabove referring to the 
video decoding apparatus embodiment of Fig. 7) to con- 



figure the decoding system such that although a field- 
interlaced video signal is encoded, the finally decoded 
video signal is a progressive scanning video signal in 
which each frame corresponds to a field of the original 

5 interlaced video signal, so that the frame frequency is 
twice that of the original video signal. In that way, the 
decoded video signal can be in a form which can be 
directly supplied to various types of display apparatus 
such as computer monitors, liquid crystal displays, PDP 

w displays, etc., i.e., to those types of apparatus which 
can display only progressive scanning pictures, and 
high display quality can be achieved. 

An encoding apparatus includes a selector (8) for 
periodically selecting fields of an interlaced video signal 

75 to be converted to respective progressive scanning 
frames, by a scanning converter (1) which doubles the 
number of scanning lines per field. The apparatus 
encodes these frames by intra-frame encoding or unidi- 
rectional predictive encoding using preceding ones of 

20 the frames, and encodes the remaining fields of the 
video signal by bidirectional prediction using preceding 
and succeeding ones of the progressive scanning 
frames for reference. The resultant code can be 
decoded by an inverse process to recover the interlaced 

25 video signal, or each decoded field can be converted to 
a progressive scanning frame to thereby enable output 
of a progressive scanning video signal. Enhanced accu- 
racy of motion prediction for inter-frame encoding can 
thereby be achieved, and generation of encoded alias- 

30 ing components suppressed, since all reference pic- 
tures are progressive scanning frames rather than pairs 
of fields constituting interlaced scanning frames. 

Claims 

35 

1 . A video encoding apparatus for encoding an inter- 
laced video signal, comprising 

progressive scanning conversion means (1) for 
40 converting one field of every m fields (where m 

is an integer of value 2 or greater) of said inter- 
laced video signal to a single progressive scan- 
ning frame which has twice the scanning line 
density of an interlaced field, while leaving the 
45 remaining fields unchanged as interlaced 

scanning fields, 

first encoding means (2, 3, 4 f 5, 6, 9, 10, 11, 
12) for encoding each said progressive scan- 
ning frame either by independent internal 

so encoding of the frame or by unidirectional pre- 

dictive encoding based on progressive scan- 
ning frames which have been encoded, and 
second encoding means (15, 17, 18, 19, 20) for 
performing predictive encoding of each remain- 

55 ing field of said video signal other than said 

fields which are converted to progressive scan- 
ning frames, using, as reference frames for 
said predictive encoding, selected ones of said 
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progressive scanning frames which precede 
and succeed said each remaining field along 
the time axis. 

The video encoding apparatus according to claim 1 , s 
wherein said second encoding means performs 
scanning line decimation processing of each said 
progressive scanning frame which is selected as a 
reference frame, to convert said progressive scan- 
ning frame to have the same number of scanning io 
lines as an interlaced scanning field, and performs 
predictive encoding of said each remaining field 
with said field left unchanged in interlaced scanning 
form. 

75 

The video encoding apparatus according to claim 1 , 
comprising inter-picture prediction means (9) for 
deriving prediction signal values based on said pro- 
gressive scanning frames which have been 
encoded, and wherein said second encoding 20 
means comprises decimation means (15) for 

receiving from said inter-picture prediction sec- 
tion (9), in synchronism with execution of 
encoding processing for said each remaining 25 
field, a set of prediction signal values derived 
based on said preceding and succeeding pro- 
gressive scanning frames, 
executing decimation processing of said set to 
remove all prediction signal values other than 30 
sub-sets of prediction signal values which 
respectively correspond to scanning lines of 
said each remaining field, and 
applying remaining ones of said prediction sig- 
nal values of the set in said predictive encoding 35 
of said each remaining field. 

A video decoding apparatus for executing decoding 
of successive code sequences which have been 
generated by encoding respective progressive 40 
scanning frames derived by doubling the scanning 
line density of one in every m fields of an original 
interlaced video signal (where m is an integer of 2 
or greater), with the encoding process using intra- 
frame predictive encoding or unidirectional predic- 45 
tive encoding, and by encoding remaining fields of 
said video signal directly as interlaced scanning 
fields, with the encoding process using bidrectional 
predictive encoding based on preceding and suc- 
ceeding ones of said progressive scanning frames, so 
the video decoding apparatus comprising: 

first decoding means (31, 6, 12, 11, 32, 35) for 
decoding each of said code sequences corre- 
sponding to said progressive scanning frames ss 
by intra-frame decoding of said code sequence 
or by unidirectional predictive decoding of said 
code sequence using decoded ones of said 



progressive scanning frames, and 
second decoding means (38, 39, 40, 41 , 81) for 
performing bidirectional predictive decoding of 
respective code sequences corresponding to 
each of said fields which have been encoded 
as interlaced scanning fields, using as refer- 
ence frames decoded ones of said progressive 
scanning frames which precede and succeed 
the field corresponding to said code sequence 
along the time axis, 

picture reconfiguration means (32, 36, 42) for 
performing decimation of scanning lines of 
each of the progressive scanning frames which 
have been decoded by the first decoding 
means, to obtain a converted field having the 
same number of scanning lines as an inter- 
laced scanning field, and for executing time- 
axis combination of respective ones of said 
convened fields with said fields which are 
derived by said second decoding means, in an 
appropriate sequence for reproducing said 
original interlaced video signal. 

5. The video decoding apparatus according to claim 4, 
wherein said second decoding means performs 
scanning line decimation processing of each said 
progressive scanning frame which is used as a ref- 
erence frame, to convert said progressive scanning 
frame to have the same number of scanning lines 
as an interlaced scanning field, and performs pre- 
dictive encoding of said each field which has been 
encoded as an interlaced scanning field, with said 
field left unchanged in interlaced scanning form. 

6. The video decoding apparatus according to claim 4, 
comprising inter-picture prediction means (35) for 
deriving prediction signal values based on decoded 
ones of said progressive scanning frames, and 
wherein said second encoding means comprises 
decimation means (81) for receiving from said inter- 
picture prediction means (35), in synchronism with 
execution of said decoding processing for a field 
which was encoded as an interlaced scanning field, 
a set of prediction signal values derived based on 
said decoded preceding and succeeding progres- 
sive scanning frames, 

executing decimation processing of said set to 
remove all prediction signal values other than 
sub-sets of prediction signal values which 
respectively corresponding to scanning lines of 
said field, and 

applying remaining ones of said prediction sig- 
nal values of the set in said predictive decoding 
of the code sequence corresponding to said 
field. 

7. A video decoding apparatus for executing decoding 
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of encoded progressive scanning frames derived by 
doubling the scanning line density of one in every m 
fields of an interlaced video signal (where m is an 
integer of 2 or greater), with the encoding process 
using intra-frame encoding or unidirectional predic- 5 
tive encoding, and by encoding remaining fields of 
said video signal directly as interlaced scanning 
fields, the video decoding apparatus comprising 

decoding means (31, 6, 12, 11, 32, 35) for 10 
decoding each of said encoded progressive 
scanning frames by intra-frame decoding or by 
unidirectional predictive decoding using 
decoded ones of said progressive scanning 
frames, to thereby obtain a first series of recon- 15 
structed pictures, 

prediction error decoding means (38, 39, 40) 
for decoding of prediction error values for each 
of said remaining fields which are encoded as 
interlaced scanning fields, interpolation means 20 
(44) for performing over-sampling of said pre- 
diction error values in a vertical scanning direc- 
tion of a field, to derive reconstructed prediction 
error values corresponding ones of said pro- 
gressive scanning frames, inter-picture predic- 25 
tion means (35) for deriving prediction signal 
values for each of said fields which were 
encoded unchanged as interlaced scanning 
fields, using a preceding one and a succeeding 
one of said first series of reconstructed pictures 30 
as reference frames, 

adder means (41) for adding to said prediction 
signal values said reconstructed prediction 
error values corresponding to progressive 
scanning lines, to thereby obtain a second 35 
series of reconstructed pictures, and picture 
reconfiguration means (32, 42) for inserting 
said second series of reconstructed pictures 
into said first series of reconstructed pictures to 
obtain a reconstructed video signal in which all 40 
pictures are expressed as progressive scan- 
ning frames. 

An encoded video recorded medium having 
recorded thereon code expressing interlaced video 45 
information which has been encoded by high-effi- 
ciency encoding of an interfaced video signal, said 
encoding being characterized in that 

one in every m fields of said interlaced video so 
signal (where m is an integer of 2 or greater) is 
converted to a progressive scanning frame 
having double the number of scanning lines 
while remaining fields are left unchanged as 
interlaced scanning fields, ss 
each of the progressive scanning frames is 
either independently internally encoded or is 
encoded by unidirectional prediction encoding 



using as reference frames progressive scan- 
ning frames which have already already been 
encoded, to thereby obtain respective first 
code sequences corresponding to said pro- 
gressive scanning frames, 
each of the remaining fields which are left 
unchanged as interlaced scanning fields is 
encoded by bidirectional prediction using pre- 
ceding and succeeding ones of the encoded 
progressive scanning frames as reference 
frames, to thereby obtain a respective second 
code sequences corresponding to said remain- 
ing fields, and 

said first and second code sequences are 
recorded to constitute said encoded video 
recorded medium. 

9. A method of encoding an interlaced video signal, 
comprising 

converting one field of every m fields (where m 
is an integer of value 2 or greater) of said inter- 
laced video signal to a single progressive scan- 
ning frame which has twice the scanning line 
density of an interlaced field, while leaving 
remaining fields of said video signal 
unchanged as interlaced scanning fields, 
encoding each said progressive scanning 
frame either by independent internal encoding 
of the frame or by unidirectional predictive 
encoding based on progressive scanning 
frames which have been encoded, and 
performing predictive encoding of each the 
remaining fields of said video signal other than 
said fields which are converted to progressive 
scanning frames, using, as reference frames 
for deriving respective sets of prediction signal 
values corresponding to each of said remaining 
fields, selected ones of said progressive scan- 
ning frames which precede and succeed said 
each remaining field along the time axis. 

10. The method according to claim 9, further compris- 
ing: 

performing decimation processing of each of 
said sets of prediction signal values to remove 
respective sub-sets of prediction signal values 
which do not correspond to scanning lines of 
said each remaining field, to thereby obtain a 
set of prediction signal values for use in predic- 
tive encoding of said each remaining field. 

11. A method of decoding successive code sequences 
which have been generated by a first encoding 
process applied to respective progressive scanning 
frames which have been derived by doubling the 
scanning line density of one in every m fields of an 
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original interlaced video signal (where m is an inte- 
ger of 2 or greater), with the first encoding process 
using intra-frame predictive encoding or unidirec- 
tional predictive encoding, and by encoding remain- 
ing fields of said video signal directly as interlaced 5 
scanning fields in a second encoding process, with 
the second encoding process using bkJrectional 
predictive encoding based on preceding and suc- 
ceeding ones of said progressive scanning frames, 
the method comprising: w 



progressive scanning frames derived by doubling 
the scanning line density of one in every m fields of 
an original interlaced video signal (where m is an 
integer of 2 or greater), with the encoding process 
using intra-frame encoding or unidirectional predic- 
tive encoding, and by encoding remaining fields of 
said video signal directly as interfaced scanning 
fields, 

the decoding method comprising: 



executing decoding each of respective code 
sequences corresponding to each of said pro- 
gressive scanning frames by intra-frame 
decoding of said code sequence or by unidirec- 15 
tional predictive decoding of said code 
sequence using decoded ones of said progres- 
sive scanning frames, and 
executing predictive decoding of respective 
code sequences corresponding to each of said 20 
fields which have been encoded as interlaced 
scanning fields, using as reference frames 
decoded ones of said progressive scanning 
frames which precede and succeed the field 
corresponding to said code sequence along 25 
the time axis, 

performing decimation of scanning lines of 
each of the progressive scanning frames which 
have been decoded by the first decoding 
means, to obtain a converted field having the 30 
same number of scanning lines as an inter- 
laced scanning field, and 
combining respective ones of said converted 
fields and said fields which are derived by said 
second decoding means, along the time axis, 35 
in an appropriate sequence for reproducing 
said original interlaced video signal. 

12. The decoding method of claim 11, wherein said 
predictive decoding of a code sequence corre- 40 
spending to a field which has been encoded as 
interlaced scanning fields is performed by 

deriving a set of prediction signal values based 
on decoded progressive scanning frames 45 
which precede and succeed said field along the 
time axis, 

executing decimation processing of said set to 
remove all prediction signal values other than 
sub-sets of prediction signal values which so 
respectively corresponding to scanning lines of 
said field, and 

applying remaining ones of said set of predic- 
tion signal values in said predictive decoding of 
the code sequence corresponding to said field, ss 

13. A method of decoding successive code sequences 
which have been generated by encoding respective 



decoding each of said progressive scan- 
ning frames by intra-frame decoding or by 
unidirectional predictive decoding using 
decoded ones of said progressive scan- 
ning frames, to thereby obtain a first series 
of reconstructed pictures, respective sets 
of prediction error values for each of said 
remaining fields which are encoded as 
interlaced scanning fields, 
performing over-sampling of said predic- 
tion error values in a vertical scanning 
direction of a field, to generate recon- 
structed prediction error values corre- 
sponding to scanning lines of a 
progressive scanning frame, 
deriving, for said each field which was 
encoded as an interlaced scanning field, 
prediction signal values based on selected 
ones of said first series of progressive 
scanning frames which precede and suc- 
ceed said each field along the time axis, as 
reference frames, 

adding said prediction signal values to said 
reconstructed prediction error values, to 
thereby obtain a second series of recon- 
structed pictures, and inserting said sec- 
ond series of reconstructed pictures into 
said first series of reconstructed pictures to 
obtain a reconstructed video signal in 
which all pictures are expressed as pro- 
gressive scanning frames. 
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